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Abstract 

Statistical estimation and test of unknown channels have attracted 
interest of many researchers. In optimizing the process of inference, an 
important step is optimization of the input state, which in general do 
depend on the kind of inference (estimation or test, etc.), on the error 
measure, and so on. But sometimes, there is a universally optimal input 
state, or an input state best for all the statistical inferences and for all the 
risk functions. In the paper, the existence of a universally optimal state is 
shown for group covariant/contravariant channels, unital qubit channels 
and some measurement families. To prove these results, theory of "com- 
parison of state families" are used. We also discuss about effectiveness of 
entanglement and adaptation of input states. 

1 Introduction 

Statistical estimation and test of unknown channels have attracted interests 
of many researchers. Below, let {Ae} ege be a family of unknown channels, 
where 6 € O is the unknown parameter. In optimizing the process of inference, 
one has to optimize not only the measurement performed upon the output state 
Ag<£>I (pin), but also the input state pi n . (One may also use a process POVM [15], 
operators {M t } teV such that Y^tev = l®tr n R pf n ■ But then one also has to 
optimize tiu R Pi n . Since an optimal input state pi n is a pure state, optimization 
of t~r-H R Pin is equivalent to optimization of pi n .) 

In general, optimal input states depend on whether we are estimating state or 
testing hypothesis about unknown channels; they also depend on error measure, 
and detail of the setting (Bayesian, minimax, unbiased estimation, Neyman- 
Pearson test, etc.). 

In some cases, however, the situation is less complicated. For example, [3] 
deals with estimation of group transform {U g } ge g, where g — > U g is a represen- 



1 



tation of the group Q and g is unknown and to be estimated. They had shown 
that there is an input state which is optimal with respect to any C/-invariant 
loss functions. (In case of Q — SU (d) and U g = g, maximally entangled states 
between the input space and the auxiliary space are optimal.) Meantime, [1] 
treats estimation of SU (2) channel by an unbiased estimator, and 'the loss func- 
tion' here is the mean square error matrix of the estimate 9 of the unknown real 
vector 9 which parameterizes Q = SU(2). Since the space of matrices is not 
totally ordered, the existence of the minimum is non-trivial. Put differently, 
if the loss is scalar valued increasing function of a mean square error matrix, 
then, maximally entangled states are optimal. Also, [IT] studies discrimina- 
tion of a pair of generalized Pauli matrices, and shows maximally entangled 
states minimize Bayesian error probability for any prior distributions. In case 
of qubits, they extended their result to minimax error probability |13j . Another 
example of such study is |16] , where discrimination of two unitary operation is 
discussed. They found that minimizers of Bayesian error probability and the 
error probability of unambiguous discrimination are the same. 

These results motivate the following definition: we say the input is univer- 
sally optimal for the family {Ag} ege , roughly speaking, if it is optimal for all 
the statistical inferences and for all the loss functions. (The rigorous definition 
will be given later.) We show that a universally optimal state exists (not nec- 
essarily uniquely) in case of group covariant and contravariant channels, unital 
qubit channels and some measurement families. 

To prove these results, we have recourse to the theory of "comparison of state 
families" [2 [10]; we write {pe}g e Q h c {c6»} eee if the family {pg} ge0 is more 
informative than another family {ag}g £ Q with respect to any kind of statistical 
inferences. Then, our target is to prove 

V {(Ag ® I) (p pt)} eee >=«= {(Ae ® I) (p')}eee , V 

for an input p pt- In particular, we utilize sufficient conditions for {po} >z c {o~o}, 
Proposition l2.2l and Lemma [2Tl 

Based on these results, some related topics are discussed. The first topic 
is effect of entanglement between the input space and the auxiliary space. For 
example, in |llj|12j[T3]. they study the condition that Bayes risk and minimax 
risk of discrimination of two unital qubit channels is smaller on an entangled 
state than on any separable state. In our case, in Sections|4][6] it is shown that a 
maximally entangle is universally optimal for some channel families. But there 
might be a separable state which is as good as maximally entangled states. So 
we question whether the entanglement is really needed or not. 

The second topic discussed is the existence of universally optimal states 
under the setting where the given channel can be used for several times. 

The paper is organized as follows. In Section^ besides introducing notations 
and definitions, the theory of comparison of state families is explained. In 
Sections [3l 0] and [5l universally optimal input states are established for a pair 
of unitary operations, covariant /contravariant channel families, and unital qubit 
channel families, respectively. In the proof, Proposition ^. 2l is used. In Section[6l 
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with the help of Lemma l^^l we investigate universally optimal states for some 
families which consist of a pair of measurements. In Section[71 the family of 
SU (d) is studied. In d — 2-case, it is shown, with recourse to Theorem l5.3l in 
Section^ that maximally entangled states are universally optimal. On the other 
hand, in d > 3-case, it is shown that any state is optimal for some statistical 
inferences. In Section[51 we investigate the conditions that an entangled state 
is strictly universally better than any separable states. In Section[9l universally 
optimal input states in case of iterative use of the given channel is studied. 



2 Preliminaries 

2.1 Settings, conventions and notations 

Below, Win, Hout, Hr etc. are finite dimensional Hilbert spaces, and B(Hi n ), 
for example, are the set of linear operators over T-Li n . lj„ and Ij„ denotes 
identity operator over Tim and over B (T-Lin), respectively. A channel is a trace 
preserving completely positive (CPTP, hereafter) map from B (%i n ) to B (H ut), 
and is represented by A, T , etc. with subscripts and superscripts. 

To do some statistical inference about a family {Ag} eee of channels Ag : 
B (T-Lin) — » B (Hout), a statistician prepares an input state pi n € B (Hi n <E) T-Lr), 
sends its "Hm-part to Ag, obtaining A# ® 1r (pin) as the output. To the output 
Ae <8>Ir (pin), the statistician apply a measurement with POVM M which takes 
values in decision space T> (an element of T> is usually denoted by t). Without 
loss of generality, throughout the paper, we suppose pi n is pure, and thus we 
suppose dim'Hj n = dimH^ = d. For a state vector \ip) g % in (g> "Hr, 

P^:=tr Wfl MM €B(M in ). (1) 

A system of vectors {|i)} i=1 is an orthonormal complete basis of Hi n . Abusing 
the notation, the same symbol is also used to denote an orhonormal basis of 

v %— 1 

is a maximally entangled state living in ® T-Lr. 

Given a linear map T from B (Hin) to B (Hout), its Choi-Jamilokovski's rep- 
resentation Ch (r) is defined by 

d 

Ch(T) := £ r(|i) (j\)®\i) 0'|. 

We also use the following notation: 

T c (p) :=CpCl 
Given a state p and a POVM M, denote (B) := tr pM (B). 
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When 6 C V = R m , we write 



E{M,p e ] := / tdP™(t). 
Jtev 



V[M, Pe ] 



(f #) (f - 0*) dP* f (t) 



2.2 Comparison of state families 

In comparison of input states, we have recourse to the theory of comparison of 
state families [2] [TU]. Consider a family {pe} eee of states over H and a family 
{oelgge °f states over H' . We say {pe} e6e is sufficient to {<7g}g g0 with respect 
to classical decision problems, and write {pe}g G Q h c { cr e}eee' ^ anc ^ om y if s 
for any decision space T> equipped with cr-field 21, any a- field *B over 0, any 
loss function I : x T> — » M + which is jointly measurable, any probability 
measure 7r over (0,03), and for any measurement M' over (Z>, 21) in there 
is a measurement M over (D, 21) in % such that 



0xX> 



Z 9 (t)dP£(t)d7r(0)< 



exi> 



le (t) dP% (t)dn(8) 



When {pe} eee h c We} eee and {cr e } eee h c {pe} eee holds, we write {pe} eee = 



Lemma 2.1 {pg} 0eei >z c {o~g} g£ Q ZioZds if and only if, for any measurement M 
on (12,21), there is a mea surement M' on (X>,21) such that P™ = P™ . 

Due to Lemma[531 {pg} e£ Q ^ c {o~g} ee Q has very strong implications: what- 
ever the settings are, and whatever the error measures are chosen, {pe}g g g is 
always better than {o-g} ff€ Q. For example, for any decision space T> equipped 
with cr-field 21, any loss function I : x T> — > K + such that 1$ (•) is measurable, 
the minimax risk is always smaller on {po}g^Q than on {o~g}g^Q : 

inf sup / l e (t) dP% (t) < inf sup / l e (t) dP# (t) . 
M seeJc M eeeJv 

Also, in hypothesis testing of Neyman- Pearson type, the second error probability 
of the optimal level a test is also smaller on {pe} e< -Q than on {ag} g( -Q. That 
is, letting V := {0, 1}, O U ©i = 0, and 



Ij (t) := 



1, (6 e ©o and t=l, or 6 € ©i and t = 0) 
0, otherwise 



we have 



M 



M\ I lT(t)dP™(t); K P (t)dP%{t)< 



< inf \ I Ij (t) dPjf (t) ; / l^ P (t) dP% (t) < a 
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Another example would be unambiguous discrimination: letting T> := {0, 1, 2}, 

e u ex = e, 



iY (t) 



00, (9 e 6o and t = 1, or 9 € 6i and < = 0) 

1, (9 e e and t = 2, or e 9i and t = 2) , 
0, (0 G 9 and t = 0, or e X and t = 1) 



we have 



inf 



(t) dP« (t) dTT ( 0) < inf / ^ (i) dP£ (t) dTT ( ( 

9xB M J0xV 



Lastly, let 6 C T> = W m . Then, mean square error of an unbiased estimator 
is always better on {pd}g € Q than on {o~ff} Se Q. That is, for any measurement 
M' with 

E [M\ a e ] = 9, 
there is a measurement M such that 

E [M, pg] = 9, 

V[M,pg}=V[M , ,ag}. 

Proposition 2.2 ' L 10llf there is a trace preserving positive map V such that 
T (fte) = o-e, we have {pe}g ee h c {<Tg} 9ee . 

Lemma 2.3 JTjThere is a CPTP map V with 

r - lv+) (v+l > r (|v>-> (iM) = l</>-) (v-l , 

// and only if 

Lemma 2.4 flOfSuppose 9 = {+, — }. If {pe}g e Q — c i^leee' 

UP+-SP-H! > IK-aff-Hi.Va^O. (2) 
J/ (0j and [p+,p_] = 0, tfien {pe} eee ^ C {^ieee- 

Lemma 2.5 Suppose = {+,—}, erg € S(C 2 ), and [/9 + , /?_] = 0. // {jOe} eee = 

[a + ,<7_]=0. 

Proof. By definition, {pe} ee Q di c { a o}ee@ om y ^ there is a measurement M 
with 



By Lcmma l2T4I this is equivalent to 

l|P+ - s P-\\i < 



P M _ « P M 



Vs > 0. 
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Also, by Lemma [23] {pg} ee0 = C { a e} See onl y if 

\\p+ - s P-\\i = \W+ - SfJ -\\i , Vs > 0. 

Therefore, we have 

\W+ ~ S(7 -\\i < 

Therefore, by the monotonicity of 1 1 - 1 1 ^ , there is a measurement M such that 
lk+ — « cr— Ha = 

Observe the above identity holds if and only if M = {M + , M_}, where M + and 
M— are the projector onto the positive and the negative eigenvector of cr + — s <j_ , 
respectively. Since M does not depends on s, combined with the fact that ag is 
a qubit state, we have 

[a + — s <7_, (j + — s' <r_] = 0, 
or equivalently, [er + ,er_] =0. ■ 

2.3 Comparison of input states 

Consider a family {Ag} eee of channels Ag : B (%i n ) — > B{H ou t). We say the 
input state p is universally better than p' and write p >z c p' if and only if p is 
better than p' for any statistical decision problem on {Ag} gee . More formally, 
p <Z C p' if and only if 

{(Ag ® I) (p)} eee be {(A e ® I) (p')} e6 e ■ 

If p >z c p' and p' ^ c p holds, we say p is strictly universally better than //, and 
write p p' . If p >z c p' and p' b c p holds, we write p = c p' and say that p and 
p' are universally equivalent. Obviously, 

p = c A mu (p) 

for any U € SU(H K ). 
Denote 

i? a, M, tt, p) := / Z e (t) dF A ^ I(p) (t) dTT ( fl) . 

An input state pe6 ("Hm <8> "Ht?) is said to be admissible if and only if, for a 
decision space I? equipped with a ct- field 21, a cr-field 03 over 9, a loss function 
1 : 6 x D -> M + which is jointly measurable, a probability measure 7r over 
(0,8), 

inf i? (7, M, tt, p) < inf i? (Z, M, tt, p') , Vp. (3) 

When the inequality in ([3]) is strict inequality "<", p is said to be strictly 
admissible. 



P. 



M 



sP, 



A I 



Vs > 0. 



pM _ pM 

(7+ CT_ 



Vs > 0. 
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3 A pair of unitary operations 



Let Ag = Ti/ e , = {+,— } and U+, U- € SU(d). [16] had discussed discrimi- 
nation [/+, U— and computed Bayesian error probability and error probability 
of unambiguous discrimination. After performing optimization for each case, 
they found that optimal input states are minimizers of the functional 

|V) -> (VI U\U- ® 1 |V) • (4) 

Indeed, generalizing their result, we can conclude that minimizers of Q are 
universally optimal, or optimal for any statistical inference made upon {Ae} egQ 
, e.g., statistical test of Neyman-Pearson test, or minimax error probability. 
This is an immediate consequence of Proposition ^. 21 and Lemma[573] 

4 Covariant and contravariant channels 
4.1 Universally optimal input states 

Let g £ Q, where Q is an element of compact Lie group or its discrete subgroup. 
Covariant and contravariant channels are those satisfying 

Ae o T Vg = T Vg o Ag, 

and 

Ae o T Ug = Tpj-o A e , 
respectively. Here g — > f7 ff , g V g are representations of 5. 

Example 4.1 Le£ 

A cdep. = r+(1 _0 )Tro; 

where T (p) = p T , T m is the channel which sends any input to the totally mixed 
state 1/d, and := [0, 1/ (d + 1)] C K. T/ien Ag dcp is completely positive, trace 
preserving, and contravariant. 

Example 4.2 Le£ T c be the m to n optimal pure state doner [9], which is 
covariant with U g := g® m , V g := g® n , H m := (C d )® a "\ and H out := (C*)® 3 ™. 
(Here, <g> s denotes symmetric tensor product. )Then, the channels 

Af := 9T C + (l-6) T m , 8ee-.= [0, 1] , 

are covariant. 

Example 4.3 Another example is A^ p e with Hi n ■— C rf , and H ou t '■= C d , 

j,fe=o 
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where 

e := I e ■ e^' k) > o, e(j,k) = 1 1 ' 

[ i,k=o J 

and Xd, Zd are generalized Pauli matrices defined by 

d-1 d 

X d := J2 \i) (i + II + W (II , Zd ■= J2 e ^ ± l f ) (*l 



i=i 



Xrf and Z<i satisfy 



(X d ) d = (Z d ) d = 1, e^a^Mj = 



(5) 



(6) 



A| p e is covariant with respect to 



Q = Qd ■= {e^^ (X d ) j {Z d ) k ;iJ,k = 0,1,' ■■ ,d-l} 

and = V g = g. Indeed, if Hm = H m t = C rf and U g = V g = g e being 
covariant is equivalent to be a member of |A| p e j [TP] , 

Example 4.4 alternative parameterization of A| p e is given by 



where 



E, := 



T] 1 

?7 2 



£^2 : = 







?y 2 
r? 1 

3 



i-Eti(^) o 



£ 4 := £3. 



Example 4.5 » Hi 

channels : 



rl 



1 , had defined generalized damping 



i=l 






Ft := y/p 


1 







, F 2 := y/l-p 


F3 ■■= \fp 


" 






C , F 4 := y/T 





1 





Then, |a^™| j «s covariant with respect toU g = V g = g € <?2- Indeed, |a^™ p j 
is a subset of |A| p e j. 
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Example 4.6 Let 



47 : =E T ^> 



where 



Ei := diag 



} \(d-l)/2\ 



\ 



\{d-l)/2\ 



^ -.^X^EiX^ 1 (2<i<d). 

Then A^'| s is covariant with respect to U g = V g — g G Qd '■ in fact, it turns out 
the family |a^'| s | is a subset of the family {A| p }. 



[5] and [5] had shown that a maximal entangles state \$><i) is optimal for 

the family |A| p e | and also for the family |a^™ p | in the following sense. For 

any input state pi n £ B (Hi n <g> Wr) and a measurement M over B (H ut <8> Hr) 
with 

E[M,A e ®I(p in )] = 9, (7) 
there is a measurement M' such that 

E[M',A s ®I(\$ d ) ($ d |)] = 

and 

V [M, K e ® I (p in )] = V [M', A e ® I (]*«,) ($ d |)] . 

Also, [11] studies discrimination of a pair of channels in |A^ p e |, and shows 

that \$>d) minimizes Bayesian error probability for any prior distributions. In 
case of qubits, they extended their result to minimax error probability |13j . 

The following theorem is a generalization of these results. Below, we suppose 
the representation g — » U„ occurs the decomposition 

Hin = @ H-in ) U g = ^ Ujfi , 

where Ug acts on and g — > £/g is irreducible. Also, define := 

Theorem 4.7 Consider the covariant or contravariant channel family {Ae} egQ . 
ITien, with ~ and 
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the jollowidng input state is universally optimal: 

\i>o P t):=c®\$ dtl ), (8) 
fj- 

where |$d M ) € ®H R , and c is the normalizing constant. 

Proof. We state the proof only for covariant case, since the argument is almost 
parallel for contravariant case. Below, we compose a completely positive trace 
preserving map T^, with 

I> (Afl<8l(|^opt) (ipopt\)) = Ae®I(\ip) 001), 
for an arbitrary \ip) £ W in (g> H' R , and use Proposition ^. 21 Here, 

Ag ® I(\lp opt ) (lpopt\) £ Wont ® Hr, 

Afl®I(|V) (ip\)£H ou t®n R , 

where "H^. ~ Hh- 

r^, is composed as follows; Prepare \if>) in %' in ®'H' R , where H' in — %i n - Apply 
the measurement M (defined later) jointly to "H^-part of Ag (g> IQipopt) (V'optl) 
and "H^-part oi\ip). Depending on the outcome g £ Q of M, apply Vl to 'Hout- 

To define the measurement M, we first define the the state vector in T-L R £g) 



H 

with the normalizing constant c', Ug^ being in H R ^ and ' — H-^. Then, 
the measurement M is the one which occurs state change 

c"l HoutmWR ®T {Vg] (p), 

with the probability density tr I« „ t <gi-H' <8 1\ Vg | (p). Here c" is the normalizing 
constant, and the density is considered with respect to the Haar measure dg 
such that Jg dg = 1. 

In the end, wc confirm that meets the requirement. By composition, 
is completely positive and trace preserving. Let {A K } be the Kraus operators 
of A e . Also, let H' R := ©^j^ ' and H^' - Then, after the application 

of M and obtaining measurement result g £ Q, the state will be the mixture of 
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the pure state in % ou t ® H' R , such that 

v^ 7 (l«„„ t ® ® lwj,) (A. ® Iwj, ® lwj B ® 1k' r IV'opt) |V» 

= CC 'V^(A K ®l^)0^(l^„® f/W T ® 1 K J |$d M )|V) 

= cc'V^ 7 (A K ® d„ (l Wi „ ® (<$ d Jl^, |$d„}lV) 



ju i=l 

= ccV^"(A K E/ s )® 1^|V). 
This mixture equals 

(cc')V (A e oT Us )®lK k (lV) (VI) 

= ( CC ')V (Ty s oA e )®I^ (|V) (VI). 

Therefore, applying Vjj to % out , we have Ag ® (|V) (Vl)> as desired. ■ 
4.2 On A?7 P 

[6] had shown that for A 1 T ip , p = |2) (2| 6 B (Hi n ) is optimal for mean square 
error under the constraint (|7|). Despite this fact, |2) (2| is not universally optimal 
as is shown below. Indeed, 



Therefore, 
1 

2 ' 



A?r P (|2) (2|)-,sAf m P(|2) (2|) 



= ii-e-*i+e, 



< 



A? a f p ® I (|$ 2 ) ($ 2 |) - S A?7 ® I (|$ a ) ($ 2 | 

= ^V(i-*+o 2 +4^+^|i-e-«|. 



A^(|2) (2|)-iA?7 P (|2) (2|) 



A^ ® I (|* a ) <$ 2 |) - i Af7 P ® I (|$ 2 ) <$ 2 |) 



v^2 
2 



Therefore, by Lemma HOI we have the assertion. 
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4.3 An alternative proof for {Aj^} e Q 

Given A^ p e ® I(|$d) ($d|) and p m G one can generate A| p e ® I (p in ) 

in the following manner. Measure Ag ® I(|$d) ($<z|) by the projectors onto 

C 1 d-l 

<^ -X^d (8) 1 |$d) f , and apply the unitary A^Zj 1 if (j, k) is observed. 
This composition works also for any channel family {A™} with 



A? : 



5 Unital qubit channels 

In this section, Hi n = H ou t = C 2 . Also we denote 

Y 2 := V ~1Z 2 X 2 . 

and define 



V = 



e x/ ~ Tc sin a e - V^Tb cos a 



G SU (2) . 



With p = (pi,p 2 ) (Pi +P2 = 1) and y € SU(2), let 

b P ,v> :=-^T(V|l»® |l)+v^(^|2»^|2) 
Also, let Tjjmot denote the universal not operation 



r uno t (C) = a Y2 (C), 

which is positive trace preserving but not completely positive. 
Observe that Y2 is unitary and Hermite, and that 



or equivalently, 
Lemma 5.1 Suppose 

or equivalently 



Y 2 V = VY 2 



TV 2 r unot — r unot o Ty 2 . 



Y 2 A e (C) Y 2 =A e (Y 2 CY 2 ) , 



Ae o r unot = r unot o A 

Then, the input \<& 2 ) is universally optimal. 



(9) 
(10) 

(11) 
(12) 
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Proof. To use Proposition ^. 21 we compose a trace preserving positive map T 
with 

r (Ag <g> I (|$ a > ($2|)) = Ag <g) I (|p PfV ) (^ p ,y|) 

as follows. First, apply the unitary V T to H^-part of Ag (g I (|$ 2 ) ($2!), obtain- 
ing 

A e ® I ((1 ® |*2) (*2| (1 ® ^ Tt )) 
= Ag ® I ((V ® 1) |*2) (*2| (V+ ® 1)) . 

Second, measure "H^-part by the measurement specified by the instrument 

{%/m,VT t ~m}, 

where 

M:= Pl |l)(l|+p 2 |2) (2|. 

If the measurement result is the one corresponding to y/M, then we are done. 
Otherwise, letting p' :— (p 2 ,p\), we obtain 

Ag (g) I (\tp p >,v) (<P P ',v\) 

= (A g (g) i) o (r unot (g r unot ) (|^ Pl v) (<p P ,vH) 
= (r\,not (g r unot ) o (Ag (g i) (|^ p ,v) (vp,v|) • 

So we apply r unot <g r unot , to obtain Ag ® I (|<p p ,y) (<£> P ,v|). ■ 
Any 4x4 Hermitc matrix belongs to 

spanjjjAtgB; A,S= 1,X 2 ,Y 2 ,Z 2 }. 
So is Choi-Jamilokovski's representation C/i (Ag). Since Ag is trace preserving, 

tr« out C7i (A e ) = l in . 

Therefore, Ch (Ag) is a positive element of span^TP, where 

TV :={A®B;A,B = 1,X 2 ,Y 2 ,Z 2 , if A = 1, then B = 1} 

Lemma 5.2 jlfy) holds ifCh(Ag) is an element of 

span R (TV - {X 2 ® l,y 2 (g 1,Z 2 <g 1}) , 

or equivalently, 

A fl (l) = l. 

Proof. Since 

A e (C) = tr^ in Ch (Ag) (l out (g C T ) , 
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and 



A g (ACA^) = tr nin (l <g> A T Ch (A e )l ® a) (l ® C T ) , 
AA e (C) A f = tr Kin (A <g> 1 C/i (A*) A f ® l) (l <g> C* T ) , 
ITT|) is equivalent to 



1 ® Y 2 Ch (Ag) 1 ® Y 2 = F 2 ® 1 Ch (Ag) F 2 ® 1, 
or equivalently, with W — Ch (Ag), 

W = 

■Tunot & r uno t 



(13) 



Each element olTV other than A^tgfl, l^®! and Zi®\ satishes (|T3|) . Therefore, 
we have the assertion. ■ 

Combining these lemmas, we have the following theorem. 

Theorem 5.3 Suppose Wi n — % ut = C 2 . Then, the input |$ 2 ) * s universally 
optimal if Ag is unital. 



Example 5.4 Due to [Hfy, the family {T v ; U £ SU (2)} satisfies j7| 
Example 5.5 Channel family {Ag} with 



Ch (Ag) 



i o o e 1 - y/^ie 2 

o oo o 

o oo o 

J-Yti 1 1 



satisfies (Ityl. In Kraus representation, Ag is expressed as 

2 



Ei 



i=l 
1 





AO 2 



,E 2 



6 A pair of measurements 

Let us consider a family {Ag} r, _i such that Ag : B (Hi n ) — > B (H ou t), H 



C m , and 



Ae {p)=Y J {tr P M 9 (i)} 



i u . 



(14) 



This corresponds to measurements which outputs classical data with prob- 
ability tr pMg (i). 
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Example 6.1 Suppose 

AI+ (i) M_ (i)=0, (i = 1, • • • , m) . (15) 

For example, suppose 

rankM + (£) = 1, 

M_ (i) := ^ {trM+ (t) • 1 - M+ ($)} . (16) 

T/ien, 

M_ (i) M+ (i) = Af+ (i) M_ (i) 

= z -L_{trM + (i)-M_(i)-(M_(i)) 2 } 
= 



m 1 f m m 

i=l L i=l i=l 



1 r 

{trl • 1 1}= 1. 



d- 1 



Thus, Mb)) is a special case of (Tl 

An input \ip) is universally optimal if 

y/p^M + (if Pi ,M^ {if Jpj = 0, (t = 1, • • • , m) , (17) 

where is as of {JJj. In particular, \<f>d) * s universally optimal. 
The proof is as follows. Suppose |17[ ) holds. Then 

A+®I(|V>) <V>|) A_ ® I (1-0) (V»|) 
= |i) («| ® VP^ M + (*) T Pv m - (*) T 

i=l 
= 

Therefore, A + ® I (|^>) (^>|) and A_ (g)l(\ip) (if>\) can be discriminated perfectly. 
Therefore, for any pi n € B {Hi n ® "Hi?), ttere is a trace preserving CPTP map 

r wit/i 

r (Ag <g> i (|^) (v>|)) = A e <g> i ( Pm ) (e = +, -), 

and by Proposition VZ.'A we have the assertion. 
Example 6.2 Let {C/j}j_j be unitary matrices such that 

m 

UiAU} = c (tr A) 1, (18) 

i=l 
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and define 



where 



Mg{l) := -UiMgUl 



[M+,M_]=0, 
M > 0, 
trM + = trM_ = 1. 



Then, \^d) is universally optimal. 
The proof is as follows. Observe 



[A+ ® I (|$ d > <*«,!) ,A_ ®I(|$,,> ($ d | 



rf 2 
= 0. 



1 m 



U l M+U, 



||A+®I(|<p) (^|)-aA_®I(| V ) (^DIK 



(19) 



= E||v / ^( m +w t - sM - w 

i=l 
m 

<^|^|M + (i) T -sM_(i) 

i=l 
m 

= £trp v |M + (i)-«M_(i)| r 

z=l 

= itr ^ Ui \M+ - s M_ | E/? J 

= tr p v tx \M+ - s M_ | = tr \M+ - s M_ | , 

where the inequality in the third line is true if \(p) — Therefore, by 

Lemm a\2.4\ we have the assertion. 



Example 6.3 Let 



M e (2) = 1 - Me (1) , 

M_ (2) = M+ (1) = M = diag (oi, • • • , a d ) , 
ai > a 2 > • • • > ad- 
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Then, 

HA+alW) <^|) -aA_®I(|^> m\i 

= || ^ {-si + (1 + s) M T ) ^|| x + ||^7, (1 - (1 + s) Af T ) ^\\ x 

< tr/v (|-sl + (1 + s) M T | + 1 1 - (1 + s) M T |) 

d 

= PiM.< d _s + C 1 + s ) Oil + |1 - (1 + *) Oil) 

i=l 
d 

= <Q-0,i,-t (1(1 - Oj) s - dj| + |aiS - (1 - a»)|) 

< |(1 - ai)s - ai| + |ais - (1 - oi)| , 

and the inequalities in the third and the fourth line are achieved by p$ = |1) (1|. 
Therefore, by Lemma \2.4\ \ip) = |1) |1) is universally optimal. 

7 SU(d) family 
7.1 d = 2 case 

In this subsection, we consider the family {Tjj ; U € SU(2)}. [4] had shown 
that | $2) is optimal for the mean square error with the constraint Also, as 
stated in Theorem l5.3l Section[Sl \$2) is a universally optimal state. 
Define for \ip) e (8) 

U (if>) := {U; tr p^U = 0, [7 G SU (d)} . 

Below, we consider the test between the two hypotheses, U — 1 v.s. U GU (ip). 
In other words, T> — {0, 1} and the loss function l^ is such that 

^ / 1, (u eit(i/>)) 

luW -\o, (17 = 1) ' 

J °' (UeitW)) 
1, (t/ = i) ' 

#(!) = # (0)=0,C/^W(^)U{1} 

Theorem 7.1 Consider the family {Tu > U <E SU(2)}. Then, ($2) is strictly 
universally optimal. 

Proof. Consider the loss function Z* 2 .Then, since ($2) an d f7 ® 1 | ( E>2) (t/ € 
W ($2)) are orthogonal, for any prior distribution ir, 

mini?(? <E,2 ,M, 7 r, |$ 2 » =0. 

M 

This is not the case if the input is not maximally entangled. Indeed, any 
U e SU (2) satisfy I t/ii| = 1 1/2 2 1 - Without loss of generality, suppose the Schmidt 
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basis of \ip) = -s/pi |1) |1) + y/p2 |2) |2), where p\ ^ P2 ■ Then, the inner product 
between and U £g> 1 \ip) equals 

(tp\U® l\ip) = pxU n + P2U22. 

But this cannot equal to because of p\ ^ P2 and \Un\ = 1 2 1 - Therefore, 

min#(Z* 2 ,M,7r,|^)) ^ 0. 

M 

■ 

7.2 Tests on SU (d) (d> 3) 

This subsection is devoted to the proof of the following theorem. 

Theorem 7.2 Consider the channel family {Tjj ; U € SU(<i)}, where d > 3. 
Then, any \ip) £ Tim ®TIr is strictly admissible. 

First, we introduce a series of propositions and lemmas. 

Proposition 7.3 Consider the channel family {Ty; U € SU (d)}. Then, ifU (tp)— 

mmRtft^^Aip')) > mini? (ft, M, ir, lip}) , 3tt. 

M Af 

Proof. By the definition of U (?/>), there is a projective binary measurement 
{Mo, Mi} which distinguishes \tp) and {U ®l\ip);U £U(ip)} without error. 
Therefore, by the definition of ft, 

iamR (ft,M,TT,\ip)) = 0, Vvr. 

If U E U O) - U (tp') ^ 0, then \ip') and U <g) 1 \ip') can not be distinguished 
perfectly. Therefore, 

min R (ft,M,Tr,\ip')) > 0, 3vr. 
Therefore, we have the assertion. ■ 
For x E R d , define 

lu; lu e C d ~\ \u>i\ = 1, ^2 x i UJ i + Xd = 
i=i 

Also, if 

X%1 ^ — ' ' ' — ^id ' 

4. J. / J. 4. i\ 

Xj . Xi - , x . — 1 x^ , X2 , • • • , 2^ 1 . 
The proof of the following lemma will be given in AppendixlAl 
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Lemma 7.4 Suppose d > 3, Xi, x\ > (i = 1, • • • , dj, > X)i'=2 a 't' an< ^ 
x 'i - > y^f-o If Ang (x) C Ang </ien a;' = sa; /or some s G R. 

Lemma 7.5 Lei po,pi G /B (C d ), and &e an orthonormal basis in <C d . 

Suppose an Hermitian matrix A satisfies 

(i\U^AU\i) =0,(* = 0,--. ,d) , 

for any unitary U G U (d) such that 

Q<(i\rfpjU\i)<± (i = !,■■■ ,d, j = 0,1). (20) 

Then we have 

A = 0. 

Proof. Let 

Eij := |<) 01 + |i) (i| , F tj := >/-T(|t) 01 - |j) . 
Then, there are real numbers a*, bij, and c 2 j with 

E/Ut/ = <H I*) (*| + I] (^J^ + ■ 

i i>j 

Due to (i\ U^AU \i) = 0, a, = 0. 

If £/ G U (d) satisfies (j2"0")l . then any member of neighborhood of £/ satisfies 
(gOD- Therefore, 

(z|[C/U[/,iJ] |») =0, (* = (),■•• ,d) , 
for any Hermitian matrix _ff . Also, 

[£?y,fl i ]=-2>/-T(|i> + (il), 

and, if £^£,7 = o^c> jfe = 0, 

H[3y>*w]H = 0. 

Therefore, 

(i\ [U^AU^Eij] \i) = -2V-Tc y -, 
(i| ]p^AU,Fij\ \i) = 2V=T6«. 

Hence, 6^ = = 0. After all, WAU = 0, implying A = 0. ■ 

Now, we are in the position to present the proof of Theorem l7.2l 
Proof. (Theorem l7.2|) In view of Proposition l7.31 it suffices to show that 
U (ip) CW (i/j') implies p$* — p^. 
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Suppose U (tp) C U (tp'). Let us define, for x £ M. d , 

{d d \ 

w;w £ C d , \uJi\ = l,'^Lo i x t = 0, = 1 > , 
i=l i=l J 

and, for a d x d matrix A, 

Diag (A) := (An, A 22 , ■ • • ,4w)- 

Then, 

W ( Pv> ) = {[/diag (w) [/ f ; 17 e C/ (d) , lj e ^ (Diag (J7 f p^)) } • 
Therefore, W (tp) C W (^') implies 

Ang (Diag (rf p^U)) C ^5 (Diag (U^p^U)) . 

or equivalently 

Ang (Diag (rfpjU)) C (Diag (l^p^U)) 
for all [7 € SU (d). 

Here we have recourse to Lemma f7.4| with X4 := (i\U^p^U\i) and := 

(i\Wp r U\i). Suppose x\ > Y$ =2 xf, and x i > EiU^?- Due toixWp^U = 
trW PtP'U = 1, this is equivalent to 

< (i\U f p^U \i) < l -, 

o<(i\u*p 4 ,u\i) <i, (* = i,--- ,d). 

Therefore, by Lemma lT4l 

Diag = Diag (rfppU) , 

which leads to 

Diag ([7+ (pTp-p^U) =0. 
Therefore, by Lemma f7.5[ = p^' . Thus we have the assertion. ■ 

8 Universal enhancement by entanglement 

8.1 Subfamily of {AfJ} 
Observe, in Example l4.6l a separable state 
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where \ f) R E Hr is arbitrary, is universally optimal. Combining with the fact 
that |4>d) is universally optimal (Theorem l4.7[) . we have 

iv) = m ■ (21) 

The proof is as follows. For unitary operators Ui £ U (%r) {i = 1, • • • , d), 
let Ui © ■ ■ ■ ffi Ud be the unitary operator acting on Hi n ® Hr — W ut ® Hr 
such that 

(^©•••©^)K) in |i} JJ = |i> in c/ i |j} iJ . 

Then, \iU t \f) = \i), 

(i/! © ... e £/ d ) IV) = I**) ■ 



Observe 



Therefore, 

(IV) (VI) 

A^ g ®l)(|$ d ) <**!)• 

Therefore, by proposition l2.2[ we have the assertion (j2Tj) . 

So in this case, entanglement between and Hr is not necessary. When 
an entangled state is strictly universally better than any separable state? Below, 
the condition for |$d) to be strictly universally better than any separable states 

is studied. After investigating the subfamily of |Af p e j in this subsection, we 

move to families of measurements (Exanrplcs l6.ll and l6.ip in the next subsection. 

Theorem 8.1 With G M 3 , let Ag = A| p ^ e . Then, there is a separable state 
IV) = iVm) |Vfl) with |Vm) |Vr) h c |$2), or equivalently 

{A e (|Vin) (Vm|)} eee ^ {A 9 ® I (|*a) (<f 2 |)} eee (22) 

i/ and only if 

(i) {^6»}g e Q *s on a straight line. 

(ii) If there is at least a pair 9\, 62 such that £ + := t;g 1 and £_ := £g 2 are 

distinct, £ + , £_ satisfies 

e + e = e+e = ^ 

or 
or 

where f£ := 1 - f± - £± - C± . 
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For example, jA'l™ 13 j in Example l4 . 5 1 does not satisfy (i). Therefore, 
is strictly universally better than any separable states. On the other hand, 
|A2'| s j (Exampl d4.6n satisfies the hypothesis of the theorem. Therefore, there 
is a separable state which is as good as |$2)- 

Proof. We first study the case where O = {+, — } and A + = A| p ^ and 
A_ = A| p _. and give necessary and sufficient conditions for (|22|) . 

Suppose ([22]) holds. Then, since A+ <g) I (|$ 2 ) and A_ <g> I (|$ 2 ) ($2!) 

commutes, by Lemma l2~51 A + (iV'm) (">Pin\) and A_ flr/'in) (4>in\) has to commute. 
Let r and f$ be a Bloch vector of \ipi n ) (ipin\ and Ag (\tpi n ) (ipin\)> respectively. 
Then, this means that 

7 r _ = ar + , (26) 

for a real number a. Also, 

T=fl = diag (al,a 2 s ,al) r, 

where 

al := 1-2$ - 2^|, 
^ := 1-2^-2$, 
a 3 := 1 - 2$ - 2$. 

Let us denote by p (r) the state with Bloch vector r. By simple calculations, 
we can verify 

\\p{f+) - sp(af+)|| 1 



1 












2 + 




h«||r + |M 


+ 







is non-decreasing in ||r+|| for any s > 0. Therefore, by Lemma f2.4l 

{p (r*+) , p (af+)} ^ c {p (rV) , p } , 

if and only if \\f + \\ > 1 1 r* + 1 1 . 

Therefore, we concentrate on r which maximizes || r + | = ||diag (a\, a\, a\) r 
This maximum can be achieved at r = (1, 0, 0), (0,1,0), or (0, 0, 1). Therefore, 
(|2"2"j) holds if and only if 

{p (o^, 0, 0) , p (oi, 0, 0) } h c {Ae ® I (|3 2 ) (<J> 2 |)} ee{+ ,_ } , (27) 

or 

{p (0, 0) , p (0, a 2 _, 0) } ^ c {A e ® I (|$ 2 ) ($ 2 |)} ee{+ _ } , (28) 

or 

{p (0, 0, <) , p (0, 0, oi) } > c {Ae ® I (|$ 2 > ($2|)} ee{+ ,_ } ■ (29) 
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Suppose ([2"T]) is the case. Then, in view of Lemma l2T4l we have to have 
\\p(al, 0,0) -sp(al,0,0)\\ 1 

= \e + +e + - s (£+£)\+\e + +e + -s(£+£)\ 

> \£ + - s£ + \ + \£ - s£ \ + \£ + - s£ \ + \£+- s£ \ 

= ||A+®I(|$ 2 )(*2|)-aA_®I(|$a)(*a|)|| 1 , (30) 

where 

£± :=i-£ ± -&-£±. 

On the other hand, observe 

\£ + e + s (£+£)\ < \£ -s£\ + \£ - s£\ , 

< i^-sei + i^-^-i- 

Therefore, the inequality @ is true for any s > if and only if identities in 
above two inequalities hold for any s > 0. Therefore, ([2"T]) if and only if (|2"3")l . 

Similarly, ([28]) and ([29]) holds if and only if ([24]) and ([25]), respectively. 
Therefore, in the case of = {+,—}, there is \ipi n } with ([22]) if and only if one 
of ([23]), ([M]) or (USD holds. 

Next, we treat the case where O is an arbitrary set, and Ag = Af^ g . We 
suppose that there is at least a pair 6±, # 2 such that £ + :— £g 1 and £_ :— 
£e 2 are distinct. In view of Lemma l231 ([2"2"]) holds only if Ag (IV'm) (ipin\) and 
Ag' (IV'm) (V'ml) commutes for any 8, 8'. Therefore, {Ag (\ip in ) (ipin\)}g ee is on 
a straight line passing through origin. Denoting Ag t and Ag 2 by A + and A_, 
respectively, for any 8 £ 0, there is Xg £ R such that 

A 9 (|Vin) (&n|) = AgA+ (|V> m ) (V>m|) + (1 - Ag) A_ (|^ m ) (^ m |) . (31) 

We assert (|221) holds if and only if 

{A+ (\tpin) (i/> in \),A- (\ipin) (ipin\)}e e e 

h c {A+ ® I (|$ 2 ) ($ 2 |) , A_ <g> I (|$ 2 ) (* 2 |)} (32) 

and 

A = A A+ + (l-A e )A_. (33) 

The statement of the present theorem follows immediately from this assertion. 

First, we show 'only if '. Obviously, (|2"21 implies ([3"2"]) . Also, due to ([2"2"]h for 
any positive operator F < 1 there is a positive operator F ' < 1 such that 

tr [{A e A+ + (1 - Xg) A_ - A e } ® I (|* 2 ) ($ 2 |)] F 
= tr [{A e A+ + (1 - Xg) A_ - A e } {\i/j in ) (^„|)] F' 
= 0. 

Here the second identity is due to fl3~T]) . Since F < 1 is arbitrary, we have 
Ag <g> I (|$ a ) (*a|) = {A e A + + (1 - A e ) A_} ® I (|* 2 ) ($ 2 |) . 
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Therefore, {Ag ® I (|$o) (^o|)}e e e is also on a straight line, and so is {Ag} eg0 . 
Thus we have (j33|) . 

To show the opposite, suppose (|32l) and ([55)1 holds. Then, for any measure- 
ment M, there exists a measurement M' such that 

pM _ \ pM , f-i \ \ p m 

- f A ( ,«.I(|*2><*2|) _ A ° • f A+<»I(|*2>{*2|) + \ L ~ Ae > / A_(»I(|*2><*2|) 

= x 9Px + (\ i , in) i^ in \ ) + (1 - As) Pjud^^i) 

_ pM' 

Hence, we have (|2"2"|) , and our assertion is proved. Thus, we have Theorem l8.ll 



8.2 A pair of measurements 

In this subsection, we investigate the measurement families studied in Exam- 
ples lOllfO of SectionE] 

First, in Example l6.ll \ipi n ) \ipn) h c \®d) holds if and only if its output can 
be discriminated with certainty, or equivalently, 

(ip in \ M+ (i) \i/> in ) (ip m \ M_ (i) |Vm> = 0, i = 1, • • • ,m. (34) 

Proposition 8.2 In case of Ho]) . (34\ l holds if and only if either 

M+ (») \^ in ) = (35) 

or 

M + (i) = c \ip in ) (ip m \ (c : constant) (36) 

holds for any i . 

Proof. If (ipi n \ M + (j) \ipi n ) = 0, we have (|3"5")) . On the other hand, suppose 
(ip in \M + (i) \ip m ) ^ 0. Then, for flU} to be true, (ip in \ M_ (i) \i/j in ) = has to 
hold. Therefore, by (H"6|) . 

trM + (i) = (^ in |M+(i)|^ n ). 

Since M + (i)'s rank is one, this holds if and only if (|36|) . ■ 
Finally, we investigate Example I6.2I Let 



M+ = |e*> <Ci| , M+ = ^ ft |e<) <e 4 | , 

i=l t=l 

where {|ej)}. , is an orthonormal basis oi'Hi n . Below, we assume 

f>T>->T- {37) 

Pi P2 Pd 

The proof of the following lemma is in AppendixIBl 
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Lemma 8.3 Suppose cti > 0, /3j > 0, and \3 r t\) holds. Suppose also Y^Li \li\ 2 > 
0. Then, 

ai _ Sj=i l7j| 2 «j 

/io/ds i/ and on/?/ i/ |7i| ^ and 

72 = 73 = ■ ■ ■ = Id = 0. 
Lemma 8.4 Suppose unitary matrices {Ui} i=1 satisfies 



Then, 



Y^UiAU} = {ctvA)l. (38) 

i=l 
m 

s jTu}BU l = (ctrB)l. 



Proof. By (|3"8")l . we have 



c tr Btr A = ]T tr BU l AU\ 

8=1 

m 

= ^tr [//S^A 



i=l 

Since this holds for any A, we have the assertion. ■ 

Proposition 8.5 In Examvle 1 6. 6 A suppose ai > 0, fti > 0. Also suppose |ff7| ) 
holds. Then, \ipin) \ipR) }z c \^d) is equivalent to the following: there is a sur- 
jection f : {1, • • • , m} — > {1, • • • , d}, a state vector \tp) and unimodular complex 
numbers U)i (i — 1,- ■ ■ ,d) such that 

c=\{i; f(i)=j}\ (j = l,2,-.. ,d). 
Proof. For \ijji n ) \^)r) _: c \&d) to hold, we have to have 

d 

tr \M+-sM-\ = |«i - *A| 



i=l 

i 

' , Vs > 0. 



^ m 



c 
1=1 



Therefore, by defining / properly, we have to have 



c 



/?, = - E (^"1 UiM-U} |Vi„) , (j = 1, • • • , d) . 



C *: /(i)=j 
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Thus, if / (<) = 1, 



ai = (il> in \UiM + u1 \il> in ) 

Pi {^UiM-Ul^in) 
ELl \li,3? a 3 



where 

d 

3=1 

By Lemma l5T5I then we should have 

M = |ex) • 



Thus, 



«i = - V (^„|c/ 4 M + t/;i^„) = -|{*;/« 



»:/(»)=! 



or 



Therefore, 



c=|{i;/(i) = l}|. 
- I] Ife) (i>in\ Ui = |ex> < 



i :/(»)=! 



Since by Lemma l8Tl 



- VC//|^„)(^n|^ = l 

Z" 3 fa * 



holds, we should have 



3=2 

Therefore, with / (i) > 1, we should have 

d 

WiU} Win) l 6 ^ ' 



3=2 



and if / (i) = 2, 



«2 
^2" 



(^ in | UiM+Uj |V 



i'. 

E?=2 l7j| 2 "j 

E-=2 fel 2 ^' 



2G 



Then, by Lemma l5T5I we should have 

UiUj \ip in ) = |e 2 ) . 

Therefore, using the same argument as the one derived (|39|) . (|40j) . and (|4Tj) . we 
have 

C =|{t;/(t) = 2}|, 

\ E U i\^in){^in\Ui=\e 2 ){e 2 \, 
C i:/(i)=2 



1 

C i:/(i)#l,2 

Recursively, for each j, we obtain 



. .<,;■';< n 

C i:/(i)#l : 2 »=3 



w<?7i \ifjv 



e Jd) 



c=\{i;f(i) = j}\ 

\ E Uj\ij m )^ m \U l = \e J )(e J \, 



i ■ f(i)=j 

1 d 

- E ^l^m)<V>m|^ = E l e *X e «l- 

C i:/(i)#l,2,- J »=j+X 

Thus we obtain the assertion of the proposition. ■ 

8.3 Entanglement breaking channels which requires en- 
tanglement 

In [T3] , they had shown that Bayes error probability of hypothesis testing of a 
pair of entanglement breaking channel is smaller with a maximally entangled 
input state than with any separable input states. Likewise, we point out that a 
maximally entangled state is universally optimal and strictly universally better 
than any separable state for some families of entanglement breaking channels. 
Such families of entanglement breaking channels can be composed using Theo- 
rem l8.ll and Propositions l8.2l and 18.51 

First, let us compose such family in the form of |a| p ^ j using Theo- 

rcm l8.ll Observe Af 1 ^ is entanglement breaking if and only if A| P ^(S)I C$2) (^l) 
is separable. By PPT criteria [5], this is equivalent to 

$ + e e >\e e - e e \,$+ e e >\e e - e e i , (42) 

where := 1 — £g — Q — £|. If a family{^e} eee satisfies does not satisfy the 
hypothesis of Theorem lS.ll is not true, \ A§ p \ is an example of a family of 
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channels with desired properties. In particular, if {£g} gee is not on the straight 
line, this is the case. Even if {^} ege is on a straight line with £g 1 ^ £g 2 , if no 
pair out of tfjj^, $ej£g 3 , (eJ^X and £ej$> 2 equals with each other, we also 
obtain an example of an entanglement breaking channel with desired properties. 

Second, consider POVM {M + (i)} such that constituent operators are of 
unit rank and not orthogonal with each other. Also, define POVM {M_ (i)} 
by (|16p. Then, by Proposition 18.21 the channel family {Ag} ge ^ + _j defined via 
(fT4|) has desired property. For example, consider a measurement with POVM 



M +«=2^ 



a ab 
ab b 2 



a 

—ab 



—ab 
b 2 





2a 2 - 2b 2 

M_ (i) = tr M + (i) 1 - M + (i) , 



where a > b > 0. 

Finally, by Proposition l8.5l we can add another set of examples. Observe 



J2 XffiAfazi 

i,3=0 



t 



1. 



where , are defined by ||5j). By Proposition l8.5[ if there is \ijjin) \iPr) with 
\ipin) \ipn) h c \$d), we should have 



e f(i,i) 



where / is a surjection onto {1, 
number. Therefore, with f(i',j') = 1 



'^(xizif \i> in ), 

, d} and uj'^ is a unimodular complex 



Therefore, by ((6|, there is a surjection / onto {1, 
complex number coy with 



, a 1 } and a unimodular 



- UJ i'i X d Z d l e l) 



For example, let 



\i) 



i=i 



where > and a, 7^ aj (i, j). Then, V^ |ei) is neither parallel or orthogonal 
to \e%). Therefore, the conditions indicated by Proposition l8.5l are not satisfied, 
and we have a channel family with desired property. 
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9 Iterative use of a channel 



Allowed to use given channel Ag for n times, one may send in identical n-copies 
of an input (identical repetition), or create a large entangled state in W®™ and 
send to the channels Af n (parallel repetition), or modify the input depending 
on the output of the previous use of the channel (sequential repetition). By 
definition, an identical repetition is a special case of a parallel repetition, which, 
in turn, is a special case of a sequential repetition. 

The final output state of the identical repetition with the input state lip}® 11 6 
(Win <8> W R )® n and the parallel repetition with the input state \ip n ) G (T-Li n ® Hr) 
is 

Pl e = {A g ® I (|V) (VI)} 8 " G B ((U out ® U R f n ) , 

and 

P « f , e = Af « ® I (|v") (V"l) g 6 ((H out ® n R f n ) , 

respectively. To describe the final output state of sequential repetition, we intro- 
duce a series of Hilbert spaces {"Hm,i}" =1 , {Hou^i}™^, H% where T-L in ,i — H% n 
and H ou t,i — %out (i = 1) • • ■ , n), and a series of completely positive trace pre- 
serving maps {Ti}™^ from B (rl ou t,i ® H R ) to B (rli n ,i+i ® Here, dimen- 
sion of 7^ is finite and large enough (in fact, dim H R = (d\mrL in ) n+1 {dxm% ou tf 
is enough.) With the initial state |V) G %i n , l ® H R , the final output state of 
the sequential scheme is 

p™ >e := (A 9 I) o T n _a ■ • • o (A e ® I) o T 2 o (A 9 ® I) o T x o (A g ® I) (|V) (VI) , 
eB{U out ,n®H n R ) 

to which the measurement M„ is applied. 

Theorem 9.1 Let {Ag} S( -Q be covariant or contravariant channels. Then, the 
universally optimal identical repetition strategy achieves the figure of merit that 
can be achieved by the universally optimal sequential repetition strategy. Here 
the optimal input state is IVopt)®", where |Vopt) is as of (E). 

Proof. By Proposition ^. 21 we only have to compose a CPTP map T n with 

p^ e = f n ({A 9 ®I(|V op t)(Vo pt |)f"), 

where, with H R ~ %i n , 

{A e ® I (|Vo P t) (V opt \)} 9n G B ((U out ® ft*)®") • 

The composition off" is as follows. Define W in t , H' R with the same dimen- 
sion as T-Lin,i, T~L R which would have used in the sequential repetition protocol 
resulting the final state p™ f e . Prepare |V) in H' in 1 ® T~L' R , and apply T, which 
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is composed in the proof of Theorem l4.7l jointly to H' in -ppart of \ip) and Hr- 
part of Ag (g) I (\ipopt) (ipo P t\) S & (Hout ® producing (Ag <g) I) (ip\) in 

the space H ou t ® Then apply Ti, producing Ti o (Ae ® I) (V>|) in 

T-L' in 2 ® 'H^'. Repeating this for n times, composition of T n is done. ■ 



Proposition 9.2 Consider the family {Ag} ge ^ j in Examvles \6.1\ and \ 6.°A 

Then, a universally optimal input state for parallel repetition is (identical 
repetition). 

Proof. If Ag is in the form of dHJ, or of UT5J, so is Af n . Therefore, |$ d ») = 
l^d) 8 ™ is optimal. ■ 

This proposition motivates following definition of classical adaptation: Given 
Af n , we divide this into Af ni , Af l \- ■ ■ ,Af ™ m , with £™ r* = n. We know 
that preparing input state separately in each block \tpi) 6 T-ifn 1 , ^2) € Wf™ 1 ,■ ■ ■ ,|V>n 
Hfn™ can achieve the same as the optimal parallel repetition. So the question 

arises whether we can do better by choosing t/jJ 3 \ depending on the data 
x^ 1 = (x\,X2, ■ ■ ■ ,Xj-i) from measurements M\, Mf , • • • applied to 

l^i)) V*2 ^» " ' ) V'J-i )) respectively. (Note here the measurement at jth step 
is depends on the previous data sequence a; 3 " 1 = (xx, X2, • • ■ , %j—x)-) 



Theorem 9.3 Consider a channel family in Example \Kl\ or \6.Sl Then, classical 
adaptation does not improve identical repetition. 

Proof. LctAP := |Mi, M? , ■ ■ ■ ,Mf~ 1 } , i and ft := [|^i> , ipf) >■ ■ • , V^ 1 )} . 1 - 

Also, p g g Im ? m (t) is the probability of choosing the decision t when sequence 

of adaptive measurements M m and inputs ip m are chosen. Then, with the prior 
distribution n (9), the minimized risk is 



M"\i/> 



inf V inf VVtt(6») x 

J> fl) tf»-i,VT»-i (z m - X )tr {A®"™®I( feB .-x)M* m ~ 1 (t)}l s (t), 

Let us denote the marginal distribution of x m ~ x and conditional distribution of 
9 given x m_1 by 
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respectively. Then, the minimized risk is 

= - Sft , E P«,M-^ (* m_1 ) x 

M m-l ; ^,m-l ^ i 

inf E^m- 1^-1 (^ m - 1 )tr{A®"»®I(p a!m -i)M» m " 1 (t)}/ 9 (t). 

By definition of a universally optimal state, infimum over p x m-i can be achieved 
by p x m-i — |<j>d) ($ ( j| (8 ™ m i which does not depends on the data sequence x m_1 . 
Therefore, we can merge the last two steps into one; depending on x n 
send p xm -2® ($ d |®" m into A®™"- 1+Tl ™ <g> I and apply M^_\ 2 and M^" 
successively. Repeating this process, we can get rid of classical adaptation. 
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A Proof of Lemma [7\4] 

Lemma A.l Suppose xt > (i = 1, • • • , d) Then, Ang (x) ^ if and only if 

d 

x\<J24- (43) 



Proof. Obviously, we only have to prove 'if. If d = 3, the assertion follows 
from triangle inequality. Suppose the assertion is true for d — 1, or for any 
V = (2/1,2/2, • • • ,2/d-i) with 

yi > y% > ■ ■ ■ > yd-i 

and 

2/i < 

i=1 

we have Ang (y) =^ 0. Suppose x^_ 1 + x^ d < x j and 

d d-2 
i=2 i=2 

hold. Then, 

2/i := ari, 2/2 := %2, ■ ■ ■ ,Vd-2 ■= Xd-2, Vd-i = x\_ x + x\, 

Ang (y) ^ by the hypothesis of induction. Therefore, Ang (x) ^ holds. On 
the other hand, suppose K_i > x\. Observe x\ > x\ > ■ ■ ■ > x^ d > yields 



J2 x i- x d-i+ x i 
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Therefore, with 

Vi ■= ^-1 + x tv = x ii y3=xi,-' ,Vd-i = % d -2' 

by the hypothesis of induction, Ang (y) ^ holds. Therefore, Ang (x) 7^ 
holds. After all, we have the assertion. ■ 

Lemma A. 2 Suppose d — 3 and x% > 7 x\ > X2 > X3. Then, if xi < X2 +^3, 
Anp (a;) = {(wi, w 2 ) , (S7T, u^)} . 

Ifxi =x 2 + x 3 , 

Ang(x) = {(-1,1,1)}. 
Lemma A. 3 Suppose d > 4, x\ > a; 2 > • • • > Xd > 0, and 

d 

si<^Xi. (44) 

i=2 

Then, Ang (x) is a (d — 3)- dimensional smooth manifold. 
Proof. Let 

d-l 

z k := 2J XiUi + x d , (k = 3, • • • , d - 1) , 

Zrf := Xd, r k := \z k \ , 
r := (7-3,7-4, • • • ,r d -i) ■ 

Suppose f is fixed. Then, length of each edge of each triangle z\z k ^\z k (k — 
3, • • • , d) is decided, and uj can take only finite possible values. Also, the map 
from f to tu is smooth. Therefore, we use r as a local coordinate of Ang (x). 
Let A (x) be the set of all rs such that uj € Ang (x). Below, we show the interior 
A(x)° of A(x) is non-empty. Then, the assertion of the lemma immediately 
follows. 

An element of A (x) is constructed as follows. We first fix ujd-i, Td—i, then 
0Jd-2, Td-2, ■-, 0Jk+i, rk+i- We choose r k so that the following (|46|) and (|47| 
are satisfied (then, uik can take only one of two possible values.); First, by 

rk = \zk+\ + x k uj k \ , (45) 

existence of Uk is equivalent to 

\r k +i - x k \ < r k < r k+ i + x k . (46) 

Also, for uik-i,- ■ ■ ,wi to exist, by Lemma fA.ll it is necessary and sufficient that 

fe-i fe-i 
x\ - < r k < ^Xi- (47) 

i=2 i=l 
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Therefore, A (x) is the set of rs with P5|) and (|4"T)) for each = 3,- • • ,d — 1. 
Therefore, A (a;)° is the set of all rs with 

|r fc+ i - x fc | < r k < r k+ i + x k , (48) 

and 

fc-i fc-i 

i=2 i=l 

for each fc = 3,- • • ,d — 1. 

This A (x)° is non-empty due to the following reasons. By (|44l) . we have 

d-2 

Xi~y^Xi< Td+Xd-l- 
i=2 

Also, by Xd-2 > a^d-i > > 0, we have 

d-2 

Vd - Xd-i\ < ma,x{r d ,x d -i} < x d -2 < ^Xj. 

»=i 

Therefore, combining these, the overlap of the set 



r d -i ; xi 



d-2 d-2 ~j 

^Xi < r d -i < y^ y xt > 

i=2 i=l J 



and the set 

{r<j-i ; |r<j - 2Bd-i| < fd-\ <r d + 
is not empty. Recursively, suppose r k with (|48|) and (|49l exists. Then, by (|48|) . 

fc-2 fe-2 
r k - x k -i < r k < E x ii x i ~ E Xl < rfe + :z;fe - 1 - 

i=l i=2 

Also, by x\ > X2 > ■ ■ ■ > Xd and Xi > 0, 

k-2 

x k -i - r k < x k _i <y^ Xj. 

i=l 

Therefore, 

fc-2 fe-2 

\r k - x k -i\ < 22 X U x l - X! 1 ' < rfc + ^-l' 

t=l »=2 

Therefore, there is r/c_i with 

\r k - X k -i\ < r fc _i < r k + x k -i 

and 

fe-2 fc-2 

Xl - 2J 3C< < ^fc < 2J Xi ' 

i=2 z=l 

Therefore, there exists r such that (|48|) and (|49|) hold for each fc, or equivalently, 
A (a;) is non-empty. ■ 
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Lemma A. 4 Suppose d > 4 and x% > (i = 1, • • • , dj. TTien Ant/ (a;) contains 
an element Q such that the set {u>\, • • • , ^d—i} contains at least three distinct 
elements. 

Proof. Suppose uj\, ui2, ■ •• , Wd-i can take at most two distinct values for any 
element of Ang (x). Let / be a subset of {1, • ■ ■ ,d— 1} , and uji — V] (i E I), 
uii = v'j (i g 7 C ). Then, (V/, t/j) is decided by Lemma lA.21 Moving I over all the 
subsets of {1, • • ■ , d — 1}, (i/j, z/j) can move over discretely many values. This 
contradicts with Lemma lA.3l ■ 



Now, we are in the position to state the proof of Lemma [74l 
Proof. (Lemma l7.4p When d = 3, the assertion is trivial. So suppose d > 4. 
Let 

where 77^ (t) are smooth functions. (Such smooth parameter t exists due to 
Lemma lA.31 ) Then, 



d-l 

i=i 



-lr/i 



Id 



Differentiating by t, 



d-i 



d-l 



d-l 



-l?7i 



i=l 



Due to Lemma [A.3[ with 

fj :-- 



i=i 



(50) 



(jlv 



'/2: 



,Vd-i 



span {77; ([50)) holds} is d — 3 dimensional. Therefore, its orthogonal complement 



pd-1 



is at most two dimensional. By ([501) . 

(xi cos 771, • • • cos^-i ), (xisin??!,' 

(a^ cos 771 , • • • , cos 77^-1 ), {x\ sin 771 , • 



, a; d _i sinr^-i ), 
, a^_i sin 77^-1), 



are orthogonal to span {77; ([50]) holds }. By Lcmma lAT4l we can choose 77.; so 
that the set {771,772,- •• ,^-1} contains at least three distinct values. There- 
fore, [x\ cos 771, • • • , Xd—i cos Tjd—i ) and (x\ sin 771 , • • • , £d-i sin 77^-1 ) are lin- 
early independent, thus can be chosen as a basis of orthogonal complement 
of span {77; ([5H|) holds}. Therefore, there are oi,«-- ,04 with 



(xi cos 77! , • • • ,x' d _ 1 cos?7 d _i ) = ai (x : cos 771, 
(x[ sin 7/1 , • ■ • , a;^! sin 77^1) = a 3 (xi cos 771, ■ 



, x d _ 1 cos 77 d _! ) + a 2 (x 1 sin 774 
, x d - 1 cos 7i rf _ 1 ) + a 4 (xi sin 771 , 



,a; d _ism77 d _i), 
,x d -i sin?7 d _i ). 
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Therefore, 

(ai cos rji sin rji + a 2 sin 2 rji — a 3 cos 2 r\i — 04 cos rji sin r\i) Xi 

ai-a 4 a 2 + a 3 a 2 - a 3 \ 

sm lr\i cos lr\{ H \xi = (J. 



2 

Since Xi > 0, we have 

(ai - 04) sin2r?j - (a 2 + a 3 ) cos 2^ + a 2 - 03 = 0. 

Therefore, a\ — 04 = a 2 + 03 = 0, since the set {771 , 772 , ■ ■ • , Vd-i} contains at 
least three distinct values by Lemma [A.4l Therefore, 

a\ - a 4 = a 2 + a 3 = a 2 — a 3 = 0, 

which means 

(cos 7/1 x[, ■ ■ ■ ,cosry d _i x'^) = a^cosrjx x\,--- ,cosry d _i Xd-i), 
(sin 771 x[ , ■ • • ,wa.r)d-ix' d _ 1 ) = ai(sinr?i xi, ■ ■ ■ , sin?7 d _i x d -i). 

Since one of cos rji and sin?7i is always non-zero, we have the assertion. ■ 



B Proof of L omnia [8J3J 



Proof. (LemmaHinD 

Observe 

a-d-i _ Ei=d-i \lt\ 2 a i __ \ld\ 2 Pd ( Orf-i _ (*d 
Therefore, if Ei=d-i 17* | 2 ^ ^, we have 



^ Et d -iW 2 A 



Next, observe 



^ 2 Et d - 2 l7i| 2 ft Et rf - 2 l7d 2 ft V^- 2 Et d -il7i! 2 ^ 



> 



Ei=d-1 lTi| 2 A / Q!d-2 Cld-1 



Therefore, if Ej=d- 2 It* | 2 7^ 0, we have 



ad-2 > Ei=d_ 2 l7i| 2 "i 



jd - 2 " Et,- 2 W 2 A 
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Recursively, if Ei=2 \li\ 2 ^ 0, we have 



& ~ Et 2 N 2 ft' 



Observe 



_ Ef=i \li\ 2 a* 

& Eli N 2 ft 

= Et 2 N 2 A /V _ Ei= 2 M 2 <*A 
~EtiN 2 ft V^ 1 Et 2 N 2 /?J 

> Ei=2 l7»| 2 A ff*l _ "2 \ 

"EtiN 2 A U 



implies Ei= 2 17*1 = Thus we have the assertion. ■ 



Therefore, due to (|3TJ) 



/3i Etil7i| 2 ft 
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